Index

A  B  C  D  E  F  G  H  I  J  K  L  M  N  O  P  R  S  T  U  W  X  Y  Z 

A

access drivers, 6.1.1
ACCESS PARAMETERS clause, 6.4.5
special characters, 7
syntax rules, 7
ACCESS PARAMETERS Clause
syntax, 7
activity reports, 2.12.3
ALL_HIVE_COLUMNS view, 6.3.2.2, 7
ALL_HIVE_DATABASES view, 7
ALL_HIVE_TABLES view, 6.3.2.1, 7
Apache Sentry, 2.11.3
application adapters, 1.5.3.3
applications
data pull, 4.1, 4.1.1
data push, 4.1.2
array overflows, 7
Audit Vault
plug-in configuration, 6.1.4
Audit Vault plug-in, 2.12.2
auditing data collected from services, 2.12.1
authentication, 3.1
authorization, 2.11.3
autoAnalyze configuration property, 5.6, 5.12
autoAnalyze property, 5.4.3.2
autoBalance configuration property, 5.6, 5.12
Automated Service Manager
See OASM

B

BALANCER_HOME environment variable, 5.3, 5.3
bdadiag utility, 2.13
Berkeley DB, 1.4.3
best practices, 5.1
big data description, 1.1
binary overflows, 7
business intelligence, 1.2, 1.4, 1.6
byteWeight configuration property, 5.12

C

catalog views, 7
CDH
about, 1.3
diagnostics, 2.13
file system, 1.4.1
remote client access, 3.2
security, 3.1
version, 2.5.1
character overflows, 7
chopped keys, 5.12
chunking files, 1.4.1
client access
HDFS cluster, 3.2.4
HDFS secured cluster, 3.2.5
Hive, 3.3
client configuration, 3.2
Cloudera Manager
about, 2.2
accessing administrative tools, 2.2.2
connecting to, 2.2
effect of hardware failure on, 2.7.5
software dependencies, 2.7.5
starting, 2.2
UI overview, 2.2.1
version, 2.5.1
Cloudera's Distribution including Apache Hadoop
See CDH
clusters, definition, 1.3
column mapping, 7
common directory, 6.7.2
com.oracle.bigdata.colmap, 7
com.oracle.bigdata.datamode, 7
com.oracle.bigdata.erroropt, 7
com.oracle.bigdata.fields, 7
com.oracle.bigdata.fileformat, 7
com.oracle.bigdata.log.exec, 7
com.oracle.bigdata.log.qc, 7
com.oracle.bigdata.overflow, 7
com.oracle.bigdata.rowformat, 7
com.oracle.bigdata.tablename, 7
confidence configuration property, 5.12
Counting Reducer, 5.1.3
CREATE TABLE ORGANIZATION EXTERNAL syntax, 6.3.1, 6.4
CREATE TABLE statement
generating automatically for Hive, 7
CREATE_EXTDDL_FOR_HIVE function, 6.3.2.2
syntax, 7

D

data dictionary views, 7
data mode, 7
data replication, 1.4.1
data skew, 5.1
data source name, 7
data type conversion (Big Data SQL), 6.5
data types (HDFS), 7
DataNode, 2.7.2
dba group, 2.11.1
DBA_HIVE_COLUMNS view, 7
DBA_HIVE_DATABASES view, 7
DBA_HIVE_TABLES view, 7
DBMS_HADOOP package, 6.3.2.2, 7
DBMS_OUTPUT package, 6.3.2.2
DEFAULT DIRECTORY clause, 6.4.2
delimited text files, 7
diagnostics, collecting, 2.13
disks, 2.7.1
dnsmasq service, 4.4
duplicating data, 1.4.1

E

emcli utility, 2.1.2
enableSorting configuration property, 5.12
encryption, 2.11.4
engineered systems, 1.2
error handling, 7
error handling (Big Data SQL), 6.6.2
Exadata Database Machine, 1.2
Exadata InfiniBand connections, 4.3
Exalytics In-Memory Machine, 1.2
External table clause, 6.4
external tables, 1.5.3.1
about, 6.1.1

F

failover
JobTracker, 2.6.5
NameNode, 2.6.4
feedbackDir configuration property, 5.12
field extraction, 7
field names, 7
files, recovering HDFS, 3.5
first NameNode, 2.7.3
Flume, 2.5.2, 2.11.1
ftp.oracle.com, 2.13

G

groups, 2.11.1, 3.4

H

Hadoop Distributed File System
See HDFS
hadoop group, 3.4
Hadoop log files, 7
Hadoop version, 1.3
HADOOP_CLASSPATH environment variable, 5.3, 5.11.1
HADOOP_USER_CLASSPATH_FIRST environment variable, 5.3
HBase, 2.5.2, 2.11.1
HDFS
about, 1.3, 1.4.1
auditing, 2.12.1
user identity, 2.11.1
HDFS files, 6.3.3
help from Oracle Support, 2.13
Hive, 2.11.1
about, 1.4.2
auditing, 2.12.1
client access, 3.3
node location, 2.7.6
software dependencies, 2.7.5
tables, 3.4.1
user identity, 2.11.1, 2.11.1, 2.11.1
Hive columns, 7
Hive data
access from Oracle Database, 6.3.2
Hive databases, 7
hive group, 3.4
Hive table sources, 7
Hive tables, 7
Hive views, 7
HiveQL, 1.4.2
HotSpot
See Java HotSpot Virtual Machine
Hue, 2.7.6
user identity, 2.11.1
users, 3.4.1

I

Impala, 2.5.2
InfiniBand connections to Exadata, 4.3
InfiniBand network configuration, 4
inputFormat.mapred.* configuration properties, 5.12
installing CDH client, 3.2

J

Java HotSpot Virtual Machine, 2.5.1
JDBC client, configuring for SDP, 4.6
Job Analyzer, 5.1.3, 5.4.1
job duration, 5.1
jobconfPath property, 5.12
jobHistoryPath configuration property, 5.12
JobTracker
failover, 2.6.5
security, 3.1
user identity, 2.11.1
JobTracker node, 2.7.5

K

Kerberos authentication, 3.1
Kerberos commands, 3.1
Kerberos user setup, 3.4.1.2
key chopping, 5.1.1
keyLoad.minChopBytes configuration property, 5.12
keys, assigning to reducers, 5.1.1
key-value database, 1.4.3
keyWeight configuration property, 5.12
knowledge modules, 1.5.3.3

L

linearKeyLoad properties, 5.4.3.2
linearKeyLoad.* configuration properties, 5.12
Linux
disk location, 2.7.1
installation, 2.5.1
load, 5.1
Load Balancer, 5.1.3
loading data, 1.5.3.1, 1.5.3.2
LOCATION clause, 6.4.3
log files, 7
login privileges, 3.4.2

M

mapper workload, 5.1.1
mapred configuration properties, 5.12
mapred user, 2.11.1
mapred.map.tasks configuration property, 5.12
MapReduce, 1.3, 1.5.1, 2.12.1, 3.1, 3.4.1
mapreduce configuration properties, 5.12
map.tasks property, 5.12
maxLoadFactor configuration property, 5.12
maxSamplesPct configuration property, 5.12
max.split.size configuration property, 5.12
minChopBytes configuration property, 5.12
minSplits configuration property, 5.12
monitoring activity, 2.12.3
multirack clusters
service locations, 2.6.2
MySQL Database
about, 2.7.5
port number, 2.11.5
user identity, 2.11.1
version, 2.5.1

N

NameNode, 3.1
first, 2.7.3
NameNode failover, 2.6.4
Navigator, 2.5.2
NoSQL databases
See Oracle NoSQL Database
numThreads configuration property, 5.12

O

OASM, port number, 2.11.5
ODI
See Oracle Data Integrator
oinstall group, 2.11.1, 3.4
on-disk encryption, 2.11.4
Oozie, 2.7.6
auditing, 2.12.1
software dependencies, 2.7.5, 2.7.5
software services, 2.11.1
user identity, 2.11.1
openib.conf file, 4.5
operating system users, 2.11.1
Oracle Audit Vault and Database Firewall, 2.12
plug-in configuration, 6.1.4
Oracle Automated Service Manager
See OASM
Oracle Big Data SQL
access drivers, 6.1.1
data type conversion, 6.5
general description, 1.5.2, 6.1
installation changes on Oracle Exadata Machine, 6.7
security, 6.1.4
Oracle Data Integrator
about, 1.5.3.1, 1.5.3.3
node location, 2.7.6
software dependencies, 2.7.5
version, 2.5.1
Oracle Data Integrator agent, 2.11.5
Oracle Database
access to Hive data, 6.3.2
HDFS file access, 6.3.3
Oracle Database Instant Client, 2.5.1
Oracle Exadata Database Machine, 1.2, 4
using as a CDH client, 3.2.2
Oracle Exadata Machine
Big Data SQL installation changes, 6.7
Oracle Exalytics In-Memory Machine, 1.2
Oracle Linux
about, 1.3
relationship to HDFS, 1.3
version, 2.5.1
Oracle Loader for Hadoop, 1.5.3.2, 2.5.1
Oracle NoSQL Database
about, 1.4.3, 1.5.3.4
port numbers, 2.11.5
version, 2.5.1
Oracle R Advanced Analytics for Hadoop, 1.5.3.5, 2.5.1
Oracle R Enterprise, 1.5.4
Oracle SQL Connector for HDFS, 1.5.3.1
Oracle Support, creating a service request, 2.13
oracle user, 2.11.1, 3.4
Oracle XQuery for Hadoop, 1.5.3.4, 2.5.1
ORACLE_HDFS access driver, 6.3.3, 6.3.3.1
ORACLE_HIVE
access parameters, 7
ORACLE_HIVE access driver, 6.3.2.3
ORACLE_HIVE examples, 6.3.2.3
oracle.hadoop.balancer.* configuration properties, 5.12
oracle.hadoop.balancer.autoAnalyze configuration property, 5.6
oracle.hadoop.balancer.autoAnalyze property, 5.4.3.2
oracle.hadoop.balancer.autoBalance configuration property, 5.6
oracle.hadoop.balancer.Balancer class, 5.10
oracle.hadoop.balancer.KeyLoadLinear class, 5.12, 5.12
oracle.hadoop.balancer.linearKeyLoad.* properties, 5.4.3.2
ORC files, 7
out of heap space errors, 5.9
overflow handling, 7

P

Parquet files, 7
parsing HDFS files, 7
partitioning, 2.7.1, 5.1.1
Perfect Balance
application requirements, 5.2
basic steps, 5.3
description, 5.1
planning applications, 1.2
PL/SQL packages, 7
port map, 2.11.5
port numbers, 2.11.5, 2.11.5
pulling data into Exadata, 4.1, 4.1.1
puppet
port numbers, 2.11.5
security, 2.11.6
user identity, 2.11.1
puppet master
node location, 2.7.3
pushing data into Exadata, 4.1.2
PUT_LINE function, 6.3.2.2

R

R Connector
See Oracle R Advanced Analytics for Hadoop
R distribution, 2.5.1
R language support, 1.5.4
range partitioning, 5.1.1
RC files, 7
recovering HDFS files, 3.5
reducer load, 5.1
REJECT LIMIT clause, 6.4.4
remote client access, 3.2, 3.3
replicating data, 1.4.1
report.overwrite configuration property, 5.12
reportPath configuration property, 5.12
resource management, 2.5.3, 2.10.2
row format description, 7
row formats, 7
rowWeight configuration property, 5.12
rpc.statd service, 2.11.5

S

SDP listener configuration, 4.7
SDP over InfiniBand, 4
SDP, enabling on Exadata, 4.5
Search, 2.5.2
security, 2.11
Sentry, 2.11.3
sequence files, 7
SerDe parsing, 7
service requests, creating for CDH, 2.13
service tags, 2.11.5
services
auditing, 2.12.1
node locations, 2.6.1
skew, 5.1
SmartScan, 6.1.3
SmartScan mode, 7
Sockets Direct Protocol, 4.1
software components, 2.5.1
software framework, 1.3
software services
node locations, 2.6.1
port numbers, 2.11.5
source name, 7
Spark, 2.5.2
Sqoop, 2.5.2, 2.11.1
ssh service, 2.11.5
static data dictionary views, 7
struct overflows, 7
svctag user, 2.11.1

T

tables, 1.5.3.1, 1.5.3.2, 3.4.1
TaskTracker
user identity, 2.11.1
text files, 7
text overflows, 7
tmpDir configuration property, 5.12
tools.* configuration properties, 5.12
trash facility, 3.5
trash facility, disabling, 3.5.3.1
trash interval, 3.5.2
troubleshooting CDH, 2.13
TYPE clause, 6.4.1

U

union overflows, 7
uploading diagnostics, 2.13
useClusterStats configuration property, 5.12
useMapreduceApi configuration property, 5.12
user access from Oracle Database, 6.6
user accounts, 3.4.1
user groups, 3.4
USER_HIVE_COLUMNS view, 7
USER_HIVE_DATABASES view, 7
USER_HIVE_TABLES view, 7
users
Cloudera Manager, 2.2.2
operating system, 2.11.1

W

writeKeyBytes configuration property, 5.12

X

xinetd service, 2.11.5
XQuery connector
See Oracle XQuery for Hadoop

Y

YARN support, 1.5.1

Z

ZooKeeper, 2.11.1